Data Professional Survey Analysis

In this Power-bi project, we do an analysis of a Data Professional Survey that was done and try to draw some insights from these data. The survey was taken from a total of 630 respondents targeting specifically those working in the data field. Here is a step to step walk through of the project.

  1. Loading Data Into Power-bi
  2. The first step was loading data into Power-bi. Power-bi accepts data from multiple data sources. Our data from the survey was stored in an Excel Workbook and so we use that option to load data into Power-bi. Instead of loading the data directly into Power-bi, we transform it to open the data in Power-Query for Data Cleaning.

  3. Data Cleaning/Data Transformation
  4. After loading the data into Power-Query, I then cleaned the data to make it more usable. This involved:

    • Removing unwanted columns
      Fullscreen Image
    • Transforming the Job Title column specifically the 'other' entries in the column.
      Fullscreen Image
    • Transforming the Favourite Programming Language column specifically the 'other' entries in the column.
      Fullscreen Image
    • Transforming the Country column specifically the 'other' entries in the column. This is to simplify the analysis as the respondents were from many different countries
      Fullscreen Image
    • Transforming the Industry column specifically the 'other' entries in the column.
      Fullscreen Image
    • Creating a new column of Average Salary from the salary column with ranges.
      Fullscreen Image

  5. Count of Survey Takers
  6. We start our analysis by finding out how many people took the survey. We find there were a total of 630 survey takers. This is a good sample size to continue with our analysis and draw some meaningful insights that could actually be used for decision making by various stakeholders eg an employer or a person looking to enter the data field.

    Fullscreen Image

  7. Average Age of Respondents
  8. It was also helpful to know the average age of respondents. This can help us better understand some of the choices or answers the respondents gave among other reasons. The average age of survey takers was 29.87 years.

    Fullscreen Image

  9. Average Salary by Job Title
  10. The next stop was to determine which career or job title in the Data Field has the highest salary. We have 5 main career paths in the Data Field; Data Scientist, Data Analyst, Data Engineer, Data Architect and Database Developer, and try to compare their salaries.

    Doing this analysis, we find out that Data Scientist are paid the most averaging 94K USD while the least paid were the Database Developers averaging 33K USD. It is important to mention however, of the survery takers, most were Data Analyst and might be the more accurate representation of the real world average salary of Data analyst.

    Fullscreen Image

  11. Country of Respondents
  12. We then analyse the data to find out the country of Respondents. This will help us to filter down our analysis by country as some metrics are country based eg Salary.

    Fullscreen Image

  13. Work/Life Balance
  14. We then tried to find out whether the people working in the data field are happy with their work/life balance. This can help someone make a decision on whether to join the field if this is something that's important for them.

    Fullscreen Image

  15. Satisfaction with Salary
  16. We then tried to find out whether people are happy with the salaries they recieve on a scale of 10.

    Fullscreen Image

  17. Average Salary by Sex
  18. Finally, an important metric that various stakeholders can find useful was finding out the average salary by sex. This can help someone know whether there's any gender bias in the field. The analysis revealed little difference in the wage gap between the two genders with women earning slightly higher than their male counterparts in the sample taken.

    Fullscreen Image

  19. Creating a Dashboard
  20. Finally we take all the visualizations created and combine them into a dashboard. Here we can filter based on various metrics like country where the respondents live, job title etc and get more personalized analysis based on the insights you need from the data.

    Fullscreen Image

    To download and view the full project on GitHub, click here.